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Single-Integration, Adaptive Delta Modulation 

By P. CUMMISKEY 
(Manuscript received March 11, 1975) 

An estimate of optimum performance is derived for a single-integration, 
adaptive delta modulator. Several simulations of adaptive delta modulators 
with single integrators have all produced signal-to-noise ratios near or 
below the estimate. 

The derivations presented here indicate that the performance of a single- 
integration delta modulator is dependent on the correlation between adjacent 
samples of the input signal and on the probability density function of its 
derivative. The relationship between the probability density of the derivative 
of the input signal and optimum performance, in turn, explains why 
signal-to-noise ratios taken on sine waves are greater than those recorded 
while processing speech signals. 

I. INTRODUCTION 

In this paper, an equation is derived for the optimum signal-to-noise 
ratio (s/n) of a single-integration, adaptive delta modulator. Mean- 
square quantizing noise is a mathematically tractable quantity which 
appears to be a reasonably good measure of overall performance. It 
was felt that an understanding of the relationships between this 
quantity and the character of the input signal would be useful. The 
derivations and data presented here all contribute to this end. Other 
practical considerations, such as subjective evaluation, 1 transmission 
errors, 2 and tandem encoding, 3 have been discussed elsewhere. 

Several simulations 4 of single-integration, adaptive delta modulators 
on a variety of speech signals have produced s/n's near or below the 
performance estimate suggested in this paper. It is further suggested 
that this estimate is very close to the upper bound on the performance 
of such coders. The s/n formula also provides an explanation of the 
disparities between s/n's taken on sine waves and those obtained 
while coding speech signals. 

A block diagram of a single-integration, adaptive delta modulator 
is shown in Fig. 1. At the encoder, the difference between an input 
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Fig. 1 — Single-integration delta modulation. 



sample, Xi, and the previous output sample, x,-i, is quantized to one 
of two levels and coded. The code symbols, sgn («,-) through sgn (Si- N ) 
(where N may be any positive integer) are then interrogated by the 
companding logic, and the step size, A,_i, is altered before the ith 
sample is encoded. The quantized approximation to the difference, 
Si = A, sgn (Si), is added to the previous output to obtain the present 
output sample. 

The decoder operates in the same manner as the encoder except that 
the circuit is driven from the transmission channel rather than from a 
local comparator. The quantized signal at the decoder, x[, is low-pass 
filtered to eliminate noise components outside the band of x t (i.e., 
frequencies greater than } LP ), and a replica of the input signal is thus 
regenerated at the desampling filter. 

The signal-to-noise ratios referred to in this paper were taken in 
the following manner. First the noise was obtained as shown in Fig. 2 
and then the ratio of input signal power to noise power was taken. The 
technique used by DeJager for sine wave s/n's is described in Ref. 5. 
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N = TOTAL NUMBER OF SPEECH 
SAMPLES OR 1500 TO 
3000 SINE WAVE SAMPLES. 



II. EXACT S/N FORMULAS 

The following equations are derived from the diagram in Fig. 1. 

8i = Xi — Xi-i (1) 

At = Xi-\ + 6j. (2) 

If the quantizing error is defined as 

ei = i, — x if (3) 

then, from (1) and (2), the following relationship holds: 

6i = 5, - Si. (4) 

From (3), it can be concluded that 

Xi = Xi + e i} (5) 

and likewise that 

.f,_i = xu-x + e,_i. (6) 

Therefore, (1) may be rewritten as 

hi = Xi - Xi-i - ei-i. (7) 

The average power in the prediction error is therefore 

E{^ = E(x$ + E(xUi) + E{eU) 

- 2E(x i x i - 1 ) - 2E(x i e i - 1 ) + 2£(z 1 _ 1 e,_ 1 ), (8) 

where the E functions are expected or average values. It is now noted 
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that, for quasi-stationary signals, 

£(a$ = E(xU); (9) 

and that 

E{eU) = E(e%).* (10) 

Therefore, eq. (8) may be reduced to 

E(8\) [. EjXjXj-i) EixiBj-x) Eixj-xet-x) ] E(e\) ( . 
E(3j~ I E(x\) E{&) +" E(x\) \^E(x* t ) K } 

The s/n at the quantizer is given as 

S/UQ = EU$ t - «,)»] ~ EJA) (12) 

The s/n before filtering is defined as 

s/n = eWY ( 3) 

Note that (11) is equal to (12) divided by (13) or that 

Ejb 2 ) _ s/n_o , U v 

E(x\) s/n ' * ; 

Hence, by substituting into (11) and transposing terms, an equation 
for the unfiltered s/n is obtained. 

s/n = 2[i - [E(x^x)yE{tfi - \J(wZ5l/M&) (15) 

+ lEiXi-tei-iWEix^y 

III. ASSUMPTIONS AND APPROXIMATE FORMULAS 

The variance of the prediction error is unknown because 5, contains 
quantizing noise [see (7)]. Therefore, 5, cannot be optimally quantized. 

No meaningful information can be obtained directly from eqs. (1) 
through (15) without making some approximations or assumptions 
about the unknown terms [s/n Q , E(x#i-i) and E {x i - X e i -i)']. Several 
measurements and simulations taken by the author and others before 
him support the following assumptions. 

(i) The optimum step size will yield the same signal-to-noise ratio 
at the quantizer that can be achieved by quantizing the noise- 
free part of 8, (i.e., the derivative of the input signal, x,- — x,_i). 



* To the extent that (9) and (10) are equations, (15) may be called an equation. 
Some awkward anomalies exist with regard to eq. (15); however, none of these is 
relevant to the problem. 
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(u) The quantizing noise is the same as that generated by optimum 
quantization of (.r, — x,_i), and therefore 

E[(xi - z,_i)e,_i] = 0. (16) 

Hence, 

tfCr.-e.-_i) - Eix^ei-y) = 0. (17) 

Given the above assumptions, (15) reduces to 

_ (s/n gop - 1) 

S/n 2{1 - ZE^-MEW)} ' {lb) 

where s/n 70p is the s/n achieved when z, — x,-i is optimally 
quantized. 
(in) Finally, in an optimum modulator the quantizing noise spec- 
trum is flat. Then the ratio of overall noise to the inband noise 
is equal to the ratio of half the sampling frequency to the band- 
width of the input signal. 

Hence, the s/n taken on the filtered signal, x' LP , is equal to the 
unfiltered s/n multiplied by the ratio of half the sampling frequency 
to the cutoff frequency of the filter. 

[s/n 90p -l][^l 

V *(*!> J 

where /„ is the sampling rate and /_p is the cutoff frequency of the 
desampling filter or the bandwidth of the input signal. 

Equation (19) is identical to Xitadori's signal-to-noise equation 6 
for differential pcm. Nitadori cautions against its use in cases where the 
quantization is coarse, however. In this paper, eq. (19) is derived using 
somewhat different assumptions which, in fact, do appear to hold for 
delta modulation. 

The validity of the three assumptions given above is the main point 
of this paper. When these assumptions hold, an important relationship 
between the amplitude distribution of the derivative of the input signal 
and s/n performance can be drawn. 

IV. RELATIONSHIP BETWEEN S/N,., AND PROBABILITY DENSITY 
FUNCTION OF X, - X w 

Paez and Glisson, 7 among others, have shown that the amplitude 
probability distribution of speech and its derivatives is closely ap- 
proximated by the gamma distribution. Figure 3 shows that this dis- 
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Fig. 3 — The amplitude probability density function of Or, — x,-_i) as compared 
with the Laplacian and gamma distributions. 

tribution closely approximates the probability distribution of 
(xi — .i\_i) for telephone signals used in my simulations. The dis- 
tribution of (xi — Xi-i), taken on the speech used in Jayant's simula- 
tions, lies closer to a Laplacian distribution, however. 



1468 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1975 
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Given a distribution that is symmetrical about the origin, the 
quantization step is optimum when 



r (y - A)P(y)dy = 0, 
Jo 



(20) 



where y relates to (z, — .r,_i). With the step set at the optimum size, 
Paez and Glisson, Max, 8 and others have calculated the noise power at 
the output of a two-level quantizer, 

ELM - 2/) 2 ] = 2 r (y - AYP(y)dy, (21) 

Jo 

and achieved the s/n's shown in Table I. 

V. COMPARISONS WITH SIMULATIONS 

The correlations between adjacent samples was taken on speech 
obtained using a carbon-button, telephone transducer. Similar data 
were obtained by N. S. Jayant on speech recorded from a high-fidelity 
transducer. Both signals were processed by Jayant's adaptive delta 
modulator with a one-bit memory, where the step size is multiplied 
by 1.5 if the present and previous code words, sgn (5,) and sgn (5,_i), 
are alike, or by 0.66 if they differ. In all the simulations, the sampling 
and desampling filter cutoff frequencies are set at 3.3 kHz, except for 
the telephone speech recorded at 24 kHz. In this case, the cutoff was 
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reduced to 3 kHz. The telephone signals sampled at 48 kHz were also 
encoded by a single-integration delta modulator designed by D. E. 
Blahut. 9 Blahut's encoder also performs close to the estimate [eq. 
(19)], when processing telephone speech. Among the numerous coders 
tested, no single-integration delta modulator was found that performs 
significantly better than Blahut's or Jayant's. 

In Table II, performance estimates based on eq. (19) are compared 
with the s/n's obtained using Blahut's and Jayant's delta modulators. 
To account for difference in probability density functions (see Fig. 3), 
the estimates were made with s/n gop equal to 1.5 for telephone signals, 
and 2.0 for high-fidelity signals. 

The performance estimate given by eq. (19) is within 3.3 dB of the 
s/n's obtained in simulations with Jayant's delta modulator. The 
s/n's taken on Jayant's and Blahut's coders, while processing telephone 
speech at 48 kHz and 24 kHz, are essentially equal to the estimate. In 
these cases, the signal level was carefully adjusted until optimum per- 
formance was obtained, then further data were taken to verify eq. (19). 
(See Table III.) 

The results shown in Table III lend great support to the approxima- 
tions made in deriving eq. (19). The noise terms do effectively cancel, 
leaving a residue that is at least an order of magnitude smaller than 
the noise-free terms in the denominator of (15) (see lines 5 and 6 in 
Table III). The estimates for noise rejection at the desampling filter 
and for quantizer performance (s/n Q ) are within 0.8 dB of the figures 
obtained in the simulations. 

Both coders were simulated with a 60-dB range of step sizes, and 
both were started with the step size equal to the minimum and the 



Table II — Performance estimates 





Sampling 
Rate (kHz) 


E{xiXi- X ) 
E(&) 


Estimate 10 log m 
(s/iup) (dB) 


Delta Modulator 
Performance (dB) 




Jayant's 


Blahut's 


2.0 
*1.5 

2.0 
*1.5 

2.0 


20 
24 
40 
48 
60 


0.989 

0.957 

0.997 

0.9897 

0.999 


21.3 

13.7+ 

30.0 

22.6 

36.5 


18.0 
14.5 
28.0 
22.9 
34.0 


22.7 



* Telephone speech : The acoustic-to-electronic response of the new 500-type, 
stations sets 10 indicates that signal components in the 100-Hz to 3.3-kHz band are 
differentiated, and that components below 100 Hz are severely attenuated. Hence, 
correlation between adjacent samples is lower for telephone speech than for high- 
fidelity speech. 

f At 24-kHz sampling, f L p = 3 kHz. 
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Table III- 
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Sam- 
pling 
Fre- 
quency 
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Estimates 


Coder Performance 




Jayant's 


Blahut's 


s/tilp 


24 

48 


13.7 dB 
22.6 dB 


14.5 dB 
22.9 dB 


22.7 dB 


s/n 


24 

48 


(i.e. 
(i.e. 


5.92 

7.72 dB) 
24.3 
13.9 dB) 


5.95 
(i.e., 7.75 dB) 

24.3 
(i.e., 13.9 dB) 


26.1 
(i.e., 14.2 dB) 


Noise rejection at the 
desampling filter 
- 101og,o(/./2/£p) 


24 
48 


6.0 dB 
8.7 dB 


6.8 dB 
9.0 dB 


8.5 dB 


s/ng 


24 

48 


1.5 
1.5 


1.531 
1.555 


1.569 


r is?(x,x,_ 1 ) -] 
L *(*» J 


24 

48 


— 


0.0422 
0.0103 


0.0103 


L E(x\) 

Eixi^m-i) "I 

E(x>) J 


24 

48 






-0.00238 
-0.00112 


-0.00060 



predictor voltage equal to zero. As the average input signal level was 
varied over a 40-dB range, it was found that the s/n varied by 3 dB. 
In either coder, it was found that when performance fell significantly 
below the estimate (19), the following phenomena were observed: 

(i) Quantizer performance and unfiltered s/n changed slightly 

(in some cases, these parameters increased in value). 
(ii) The noise terms no longer effectively canceled, 
(m) There was a dramatic reduction in noise rejection at the de- 
sampling filter. It appears that when the correlation between 
the difference signal, (zt — z»_i), and the noise (16) becomes 
significant, more noise must shift into the passband of the 
desampling filter. 

Hence, the approximations used in deriving eq. (19) do appear to 
describe the optimum condition. 

These results have been obtained using both an HP2100A mini- 
computer and an IBM 370, and therefore are repeatable. Moreover, 
further validation by others using other encoders is desirable. 
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VI. SINE WAVE PERFORMANCE 

Another interesting check on the theory is the fact that it explains 
why researchers everywhere achieve much higher s/n with sine wave 
inputs than with speech signals. DeJager's formula [see eq. (22)3 
indicates that the s/n taken on a sine wave at any frequency below 
3 kHz is greater than the s/n that we predict or obtain for telephone 
speech. 

s/nneJager = (0.04) j/j^ , (22) 

where / is the frequency of the input sine wave. 

The amplitude probability distribution and s/n, op for a sine wave 
were given in Table I. Substitution of the value in Table I into eq. (19) 
yields an estimate for sine wave s/n's. 

4-28 (^ 
s/n 8ine wave = j- Ejx-l.^ ' (23) 

L #(*<) J 

Equation (23), in turn, is approximately equivalent to DeJager's 
formula. This relationship can be shown as follows. Let x = sin {2wft) ; 
then 

. T [sin(2T/0]-sin(27r/* + 27r/// g )<ft 

ZiXi—i) Jo 



f sin 2 (2irft)dt 
Jo 



(24) 



= cos 



&)■ 



E(XjXj-i 
E(x% 

or 

E(XjXj-i) 
E(x\) 

If the delay angle, (2ir ///.), is sufficiently small, then 

When (26) is substituted into (23), we obtain something very close 
to DeJager's formula : 

s/n 8ine wave = (0.054) -Jf— (27) 

J JLP 

For / = 800 Hz, f LP = 3.3 kHz, and /. = 48 kHz, estimates of 33.5 
and 34.7 dB are obtained using (22) and (27). Under these same con- 
ditions, signal-to-noise readings of 26 to 27 dB were obtained in simula- 
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tions of Jayant's delta modulator. Under similar conditions, DeJager 6 
obtained a maximum s/n of about 30 dB on a linear delta modulator. 

VII. CONCLUSIONS 

The optimum performance of Blahut's and Jayant's delta modulators 
is very close to the estimate, (19), when processing speech signals. 
Further experimentation with step-size compandors, without a change 
in the prediction technique, will not produce significantly higher 
signal-to-noise ratios. Equation (19) applies to a delta modulator with 
a single, ideal integrator; therefore, it does not preclude improvements 
through the use of fixed, higher-order networks. 

In addition, it has been shown that delta modulator performance is 
dependent on the amplitude probability distribution of the derivative 
of the input signal. This dependence should be tested on a variety of 
signals and probability density functions. The theory also implies that 
a relationship exists between the amplitude distributions of differ- 
ential waves at the input and optimum s/n, when higher-order net- 
works are used. 

Finally, 1 wish to call attention to the fact that the s/n performance 
of a delta modulator is significantly less for telephone signals than 
for low-pass filtered, high-fidelity signals, or for sine waves. 
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